Exploiting Domain Thesaurus for Medical Record Retrieval
نویسندگان
چکیده
InfoLab at the University of Delaware participated in the TREC 2012 Medical Records Track. This paper explains our method and describes experiment results. One limitation of existing keyword matching based retrieval functions is the problem of vocabulary mismatch. To overcome this limitation, we propose to first map topics and visits to bags of concepts using domain thesaurus, and then model the relevance based on the similarities between those concepts.
منابع مشابه
Construction of a Condensed Thesaurus for Building Radiology Ontology
The building of thesauri for large domains, especially for medicine, is a costly affair. However, in many domains thesauri can be constructed on an ontological basis [Wielinga , Schreiber, 2001]. We are developing an ontological information retrieval system for the retrieving of medical records from an electronic medical record system (EMR). We decided to use the UMLS as a basis for building th...
متن کاملSemantic-based Medical Records Retrieval via Medical-context Aware Query Expansion and Ranking
Efficient retrieval of medical records involves contextual understanding of both the query and the records contents. This will enhance the searching effectiveness beyond merely keyword matching and is assisted by analyzing its semantics notion such as by the utilization of the MeSH thesaurus. The query is annotated and expanded by information from the deep medical contextual understanding. This...
متن کاملAutomatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval
OBJECTIVES We present in this article experiments on multi-language information extraction and access in the medical domain. For such applications, multilingual terminology plays a crucial role when working on specialized languages and specific domains. MATERIAL AND METHODS We propose firstly a method for enriching multilingual thesauri which extracts new terms from parallel corpora, and seco...
متن کاملBilingual terminology extraction: an approach based on a multilingual thesaurus applicable to comparable corpora
This paper presents several methods for exploiting multiple resources in bilingual lexicon extraction, either from parallel or comparable corpora. First, a special attention is given to the use of multilingual thesauri, and different search strategies based on such thesauri are investigated. Then, a method to optimally combine the different resources for bilingual lexicon extraction is presente...
متن کاملMedical Documents Classification Based on the Domain Ontology MeSH
This paper addresses the problem of classifying web documents using domain ontology. Our goal is to provide a method for improving the classification of medical documents by exploiting the MeSH thesaurus (Medical Subject Headings) which will allow us to generate a new representation based on concepts. This approach was tested with two well-known data mining algorithms C4.5 and KNN, and a compar...
متن کامل